Using Fault Injection to Analyze the Scope of Error Propagation in Linux
نویسندگان
چکیده
Operating systems (OSes) are crucial for achieving high availability of computer systems. Even if applications running on an operating system are highly available, a bug inside the kernel may result in a failure of the entire software stack. The objective of this study is to gain some insight into the development of the Linux kernel that is more resilient against software faults. In particular, this paper investigates the scope of error propagation. The propagation scope is process-local if the erroneous value is not propagated outside the process context that activated it. The scope is kernel-global if the erroneous value is propagated outside the process context that activated it. The investigation of the scope of error propagation gives us some insight into 1) defensive coding style, 2) reboot-less rejuvenation, and 3) general recovery mechanisms of the Linux kernel. For example, if most errors are process-local, we can rejuvenate the kernel without reboots because the kernel can be recovered simply by killing faulty processes. To investigate the scope of error propagation, we conduct an experimental campaign of fault injection on Linux 2.6.18, using a kernel-level fault injector widely used in the OS community. Our findings are (1) our target kernel (Linux 2.6.18) is coded defensively. This defensive coding style contributes to lower rates of error manifestation and kernel-global errors, (2) the scope of error propagation is mostly process-local in Linux, and (3) global propagation occurs with low probability. Even if an error corrupts a global data structure, other processes merely access to them.
منابع مشابه
Is Linux Kernel Oops Useful or Not?
Linux kernel oops is invoked when the kernel detects an erroneous state inside itself. It kills an offending process and allows Linux to continue its operation under a compromised reliability. We investigate how reliable Linux is after a kernel oops in this paper. To investigate the reliability after a kernel oops, we analyze the scope of error propagation through an experimental campaign of fa...
متن کاملA generalized ABFT technique using a fault tolerant neural network
In this paper we first show that standard BP algorithm cannot yeild to a uniform information distribution over the neural network architecture. A measure of sensitivity is defined to evaluate fault tolerance of neural network and then we show that the sensitivity of a link is closely related to the amount of information passes through it. Based on this assumption, we prove that the distribu...
متن کاملIntroducing of the propagation criterion and investigation of that for near vacuum chamber pressures for multi hole gasoline direct injector spray
with developing of optical technologies, using of different imaging methods to study various properties of gasoline direct injection spray is under development. Scope of this article is to study the effect of overall spray angle and spray projected area for determination of spray propagation and investigate the effectiveness of this criterion. The injector was a multi-hole direct injection in...
متن کاملComparing Fail-Sailence Provided by Process Duplication versus Internal Error Detection for DHCP Server
This paper uses fault injection to compare the ability of two fault-tolerant software architectures to protect an application from faults. These two architectures are Voltan, which uses process duplication, and Chameleon ARMORs, which use self-checking. The target application is a Dynamic Host Configuration Protocol (DHCP) server, a widely used application for managing IP addresses. NFTAPE, a s...
متن کاملExperiment illustrating Failure Acceleration and Error Propagation in Fault - Injection
A system level fault injection experiment measuring changes in probability of failure and error propagation as a function of failure acceleration. It demonstrates that: (1) Probability of failure increases with failure acceleration, and (2) Error propagation decreases with failure acceleration .
متن کامل